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Abstract 

Regenerating codes allow distributed storage systems to recover from the loss of a storage node while transmitting 
the minimum possible amount of data across the network. We present a systematic computer search for optimal 
systematic regenerating codes. To search the space of potential codes, we reduce the potential search space in several 
ways. We impose an additional symmetry condition on codes that we consider We specify codes in a simple alternative 
way, using additional recovered coefficients rather than transmission coefficients and place codes into equivalence 
classes to avoid redundant checking. Our main finding is a few optimal systematic minimum storage regenerating 
codes for n = 5 and fc = 3, over several finite fields. No such codes were previously known and the matching of the 
information theoretic cut-set bound was an open problem. 

I. Introduction 

Erasure codes can be used in storage systems to efficiently store data while protecting against failures much more 
efficiently than replication. We can divide a file of size M into k pieces, each of size M/k, encode them into 
n coded pieces using an (n, fc) maximum distance separable (MDS) code, and store them at n nodes. Then, the 
original file can be recovered from any set of k coded pieces. This is optimal in terms of the redundancy-reliability 
tradeoff because fc pieces, each of size M./k, provide the minimum data for recovering the file, which is of size 
M. 

In practical distributed storage systems based on (n, fc) MDS codes, we are often faced with the repair problem f\\: 
If a node stoiing a encoded piece fails or leaves the system, in order to maintain the same level of reliability, we 
need to create a new encoded piece and store it at a new node, but we can only access other encoded blocks. One 
straightforward way to do so is to let the new node download fc encoded pieces from a subset of the surviving 
nodes, reconstruct the original file, and compute the needed new coded piece. In this process, the new node incurred 
a total network traffic of ^naive = k x M/k ^ M. 

Recent prior work |T| showed that it is possible to reduce this repair bandwidth below A4 and developed 
information theoretic lower bounds and achievable schemes. At this point, need to distinguish between two different 
repair problems: In this paper we consider the problem of systematic repair ||2| (also called exact repair |3l) where 
we require that it is exactly the same block that is reconstructed after a failure. This is in sharp contrast to functional 
repair i.e. only requiring that the new block is linearly independent and hence forms a good erasure code jointly 
with the other existing blocks |[T|. Systematic repair is a strictly harder problem, which however is of great practical 
interest since in most practical storage systems reading parts of the data is the most common operation and it should 
not require decoding of blocks if no failures have occuiTed (see also |4| for a practical analysis). 

As was shown in [l], the functional repair problem is equivalent to a multicasting problem on an information 
flow graph that adds all reconstruction points as virtual data collectors who demand all the data. Using cut-set 
arguments (which are achievable for multicasting |5|, 16|) we can determine the minimum repair bandwidth for 
MDS codes (codes matching this bound are called Minimum Storage Regenerating codes 11]): 

Mn~l 

IMSR = -, T, (f) 

fc n — fc 

if the new node is allowed to connect to d — n — 1 surviving nodes, after one failure. Note that throughout this 
paper, we are only considering the minimum storage point and we do not address other points in the storage-repair 
tradeoff curve of 

The systematic repair problem, however, is equivalent to a network coding problem where there are receivers 
who want all the data (the data collectors) and receivers who want subsets of the data (the nodes who will replace 
the failed ones are now also sinks with a demand of the lost blocks). This reduction shows exactly why systematic 




Fig. 1. An optimal (5,3) systematic MSR code over GF(3). We show the repair of the first node. The key property is that in the last two 
packets communicated, the coefficients of (03,63) are aligned (both are (1,0)) and at the same time, the coefficients of (02,62) are aligned 
(since (2, 1) = 2 X (1, 2) mod 3. This allows for only two extra blocks (2a2 + 62) and (^13) to suffice for four linear equations that can be 
solved in the desired variables (ai,6i). The rotational symmetry of the code allows all node failures to be recovered similarly 



repair is a much harder problem and careful coefficient selections are required. Further, the cut-set bound [T] is no 
longer necessarily tight and the optimal systematic repair rates are unknown for general {n,k). Recent work ||2l 
has developed an achievable scheme that is based on aligning the undesired subspaces, similarly to recent ideas for 
the interference channel (see e.g. Ii7j|) that have an achievable repair rate of 

M (fc^l)(n^fc) + 1 
liA = -T ; > (2) 

achieved by sub-packetizing each packet into q = n — k blocks of size M-jkq and communicating a total of 
(/c — 1)(7 + 1 blocks from d = n — 1 surviving nodes. 

It is easy to verify that the achievable rate (|2]i is matching the cut-set lower bound ([T]i for fc = 2 and k ~ n—\ 
but the other cases remain unknown. In this paper we present a searching approach to find systematic MSR codes 
that match the information theoretic lower bound ([T]l. Our search found some optimal systematic (5, 3) MSR codes 
(the existence of which was previously unknown), the simplest of which is shown in figure |l] The key property 
that allows optimality is that when one of the undesired subspaces is aligned (as done in the scheme of [2|), the 
other is also aligned because of the selection of coefficients of the code. This remarkable property is only possible 
if the code coefficients are carefully chosen and is closely linked to the size of the finite field. 

To search the space of potential codes in feasible amounts of time, we reduce the search space in several ways. 
We impose an additional condition that restricts the type of codes that we consider This allows us to consider only 
highly symmetric codes that can be more concisely specified. We specify a code in a simple alternative way, using 
additional recovered coefficients rather than transmission coefficients. The space of codes can be searched more 
easily and efficiently when codes are specified this way. Finally, we use linear transformations to relate codes to 
each other and place them into equivalence classes. This allows us to check only one code from each equivalence 
class. 

II. Definitions and Notation 

The storage networks that we are concerned with contain n equivalent storage nodes. We wish to store M. bits 
of data in the network, where M. k times the size of one of the storage nodes. Because of this, we say that the 
network has k source nodes. 



A. Lower hound on recovery bandwidth 
data im. We are interested in the case where d = n — 1, so this bound becomes fc(„"„fc) ■ There are n — 1 nodes 



During the recovery process, ^.^-^^l^.^, bits of data must be transmitted, where d is the number of nodes providing 



that each contain so each node is transmitting of its contents. Because of this, we store n — k packets of 
data in each storage node. We break the source data up into packets of the same size and each storage packet will 
be some linear- combination of the k{n — k) packets of source data. 

B. Notation 

We use several matrices to represent the data and coefficients used in an MSR code. 



A, 


(n- 


k) 


X 


fc(n - 


-k) 


matrix of storage coefficients 




1 




X 


(n- 


k) 


row vector of transmission coefficients 


c, 


(n- 


k) 


X 


(n- 


1) 


matrix used to rebuild storage node i 


D 


k(n - 


-k) 


X 


X 




matrix of source data 



The ith storage node contains A^D, the original data multiplied by the storage coefficients for that node. 
C. Independence 

The storage nodes of the code are independent if any k nodes can reproduce the original data. That is, for all 
combinations of k storage nodes, there is a matrix M such that 
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for any value of D. An equivalent condition is that for each combination of k nodes, the matrix of storage coefficients 
must have full rank, i.e. a nonzero determinant. 

D. Recovery 

When node j fails, the ith node transmits A^D. The code allows the recovery of node j if there is a matrix 
Cj that recreates the lost node from the transmitted vectors: 

/ Bi,,Ai \ 



= C, 



Bj+ijAj+i 



D 



(4) 



for any value of D. Therefore, D drops out of both the independent and recovery conditions, and we can focus on 
the coding coefficients only. We can also ignore the matrices because from the recovery condition we can see 
that in a working code the matrices are fully specified by the A^ and j matrices. With these two conditions, 
we can determine whether a set of A^ and B^ j matrices form a code. 

III. ROTATIONALLY SYMMETRIC CODES 

To reduce the total number of coefficients, we consider codes whose A^ matrices are related to each other by a 
simple transformation. 

Let R be an k{n — k) x k{n — k) matrix such that 



R" 



(5) 



and let 



AR' 



(6) 



A discussion of the R matrices themselves can be found in |8|. This reduces the number of storage coefficients 
needed to specify a code by a factor of n, reducing the search space exponentially. 



A. Recovery Condition 

This makes the recovery condition 
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A = C, 
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V B„,,AR" / 
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Bj_i jAR" ^ 

V B„,,AR"-^ J 

We can replace Bj ^ with Bj_j, reorder the rows of the transmitted coefficient matrix, and replace Cj with C. 
Now there is only one recovery condition. 



A = C 
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V B„_iAR"-i / 



(9) 



This is an improvement of a factor of n. 

B. Independence Condition 

Similarly, when checking independence, we only need to check combinations that include the first node. 
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This reduces the number of conditions from 

C. Example 
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The Bj matrices gives us the transmitted vectors. 

B1A2 = (1 ) 



10 
11 

= (0100) 



(11) 

(12) 

(13) 
(14) 



B2A3 = ( 1 ) ( ? ^ ; ? ) (15) 



10 1 

= (0010) (16) 

B3A, = (0 1 )(;;;; ;) hd 

= (1100) (18) 
From these we can complete the code by calculating C. 



Ai = C B2A3 (19) 
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(20) 



C = , , n (21) 



IV. Additional Recovered Coefficients 

The 'Bij matrices cannot be eliminated in a similarly simple manner, but their contribution to the code to be 
represented in alternative way. During recovery n — 1 vectors are transmitted to the lost node, but the original A, 
matrix has only n — k rows. Thus k — 1 additional vectors of coefficients are recovered. Specifying these vectors 
allows the j matrices to be determined. 

Let Zj be the fc — 1 X k{n — k) matrix that contains the additional rows recovered when node j is lost. Let 

be the n — 1 X A;(n — fc) matrix that contains all of the rows recovered when node j is lost. Then Xj(XjXj)^^Xj 
projects vectors into spanXj. A row vector v is in spanXj if the projection does not change the vector, or 

v(Xj(X,Xj)-iX,) = V. (23) 

This can be rewritten as 

v(I-Xj(X,Xj)-%)=0. (24) 

I — Xj(XjXj)~^Xj gives the difference between the original vector and the projection. This is a projection 

to the (fc — l)(r?, — fc — 1) -dimensional space F"/ spanXj. The only potentially useful vectors to transmit during 
recovery are those in spanXj, so we need to ensure that the transmitted vector Bj^Aj must satisfy 

B,,,A,(I - Xj(X,Xj)-iX,) = 0. (25) 

Thus the choices for Bj j are the vectors in the nuUspace of Aj(I — Xj(XjXj)~^Xj). 

A. Unrecovered Coefficients 

Let Yj refer to a basis that spans F"/ spanXj. Now we can rewrite the projection as I — Xj(XjXj)~^Xj = 
Yj(Yj Yj)-^Yj. Now we can say B^jA^ must satisfy 'Bi^jKiYj {Y jYjy^Y j = 0, which reduces to Bj^^ AiYj 
: I, Thus the null space of Ai(I — Xj(XjXj)~^Xj) is the same as the nullspace of A^Yj. A^Yj is a (n — fc) x 
(fc — 1) (n — fc — 1) matrix, so its nullity is at least (n — fc) — (fc — l)(n — fc — 1) or l + (n — fc — 1)(2 — fc). However, 
if2<fc<n— 1, this bound does not force the nulhty to be positive. This bound does explain why it is so easy 
to find codes when fc = 2. 



B. Example: Obtaining B from Y 

Now we can see how the B^j- vectors were discovered in the previous example. Let Yi = ( 1 ). 
Note that AiYf = as required. We apply Yf to the other Aj matrices and find the Bj^ vectors that satisfy 
BijAiYj = 0. 
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For the n = 4, A; = 2 case, nearly all choices for Yj produce a working code. This is not the case for larger 
coefficients. 

V. Transformations of codes and equivalence classes 

A. Row transformations 

Suppose we have an invertible (n — fc) x (n — k) matrix T and a working code defined by Aj and Bj_j matrices. 
Then the matrices TAj and Bj^T"^ also define a working code. For recoverabiUty we have 
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and for independence we have 
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(35) 
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The row transformation is applied to the A matrices from the left and the rotation matrix in a rotationally symmetric 
code is apphed from the right. Thus, applying the transformation to a rotationally symmetric code results in another 
rotationally symmetric code that uses the same rotation matrix. We can define codes to be equivalent if they are 
related by a row transformation. Testing only one code from each equivalence class reduces the search space by 
}p' dimensions. 

B. Column transformations 

The same technique can be applied to the columns. If we have an invertible k{n — k) x k{n — k) matrix T and 
a working code defined by Aj and Bj ^ matrices, then the matrices A^T and Bj j also define a working code. For 
recoverability we have 
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and for independence we have 
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In a rotationally symmetric code, the column transformation and the rotation are both applied from the right, so 
they interact. 



AiT = AR'T 

= ATT-^R*T 
= AT(T-iRT)* 



(41) 
(42) 
(43) 



So the new code is rotationally symmetric with a different rotation matrix, T~ RT. This means that we can use 
a simple rotation matrix when searching for codes and simultaneously check aU rotationally synmietric codes that 
use similar rotation matrices. 

This also makes it possible to put any rotationally symmetric code into systematic form. When a code is in 
systematic form, the first k storage matrices can be stacked to form an identity matrix. 

/ AiT \ 

: =1 (44) 

Finding the transformation that puts a code into systematic form is simple. It is simply the inverse of the stack of 
first k storage matrices. 




C. Example: Systematic Form 
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The same B vectors as before will work for recovery. 
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The same C matrix as before will also work. 
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VI. Search Procedure 



When searching for codes of a given n and k over a finite field, this procedure was used. Iterate over A matrices 
in a way that ensures that exactly one matrix from each row transformation equivalence class is produced. For 
each A matrix, produce the collection of n A; matrices using a single simple column rotation matrix. Then test 
the independence condition. Test it before the recovery condition because it requires only A^ matrices. If the 
independence condition is met, iterate over the space of potential additional recovered coefficients. For each Xj 
matrix produced by this process, check the recovery condition. If the condition is met, this is a code. 



A. n = A,k = 2 

These coefficients are small enough to all several fields to be searched exhaustively. We have searched the prime 
fields up to GF(13). There are no rotationally symmetric codes in GF{2), but in all larger fields codes are extremely 
easy to find. In all of these fields, nearly all of the potential codes that satisfy the independence condition also 
satisfy the recovery condition. As the field size increases, larger and larger fractions of the potential codes satisfy 
the independence condition. In GF{3), 22% of potential codes satisfy the independence condition, and of these all 
satisfy the recovery condition. In GF(13), 78% of potential codes satisfy the independence condition and of these 
92% also satisfy the recovery condition. 

B. n = 5, k = 3 

For these coefficients, codes were not previously known. We have exhaustively searched GF{2), GF{3), GF{A), 
and GF{5) and randomly searched in larger fields for rotationally symmetric codes. We found codes in GF{3), 
GF{4), GF{7), and larger fields, but none in GF{2) or GF{5). While the codes we have found in smaller fields 
are not composed of vectors in general position, we found a code in G'i^(17) that is. Several of these codes are 
given in the appendix. The full descriptions can be found in ||8l. 

C. n — 6, k ~ 3 

For these coefficients. We have yet to find any codes. In GF{3), only about 1% of potential codes satisfy the 
independence condition. In GF{4:) this number is about 14% and in GF{5) it is about 30%. 
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VII. Search results 
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VIII. Appendix 

A. (5,3) code over GF{3) in systematic form 
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B. (5,3) code over GF{7) in systematic form 
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